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Method for distinguishing AML-specific FLT3 length mutations from TKD 

mutations 



1 0 The present invention is directed to a method for distinguishing AML-specific FLT3 length 
mutations from TKD mutations by determining the expression level of selected marker 



Leukemias are classified into four different groups or types: acute myeloid (AML), acute 
15 lymphatic (ALL), chronic myeloid (CML) and chronic lymphatic leukemia (CLL). Within 
these groups, several subcategories can be identified further using a panel of standard 
techniques as described below. These different subcategories in leukemias are associated 
with varying clinical outcome and therefore are the basis for different treatment strategies. 
The importance of highly specific classification may be illustrated in detail further for the 
20 AML as a very heterogeneous group of diseases. Effort is aimed at identifying biological 
entities and to distinguish and classify subgroups of AML which are associated with a 
favorable, intermediate or unfavorable prognosis, respectively. In 1976, the FAB 
classification was proposed by the French-^American-British co-operative group which was 
based on cytomorphology and cytochemistry in order to separate AML subgroups 
25 according to the morphological appearance of blasts in the blood and bone marrow. In 
addition, it was recognized that genetic abnormalities occurring in the leukemic blast had a 
major impact on the morphological picture and even more on the prognosis. So far, the 
karyotype of the leukemic blasts is the most important indepratdent prognostic factor 
regarding response to therapy as well as survival. 



genes. 



30 



Usually, a combination of methods is necessary to obtain the most important information 
In leukemia diagnostics: Analysis of the morphology and cytochraustry of bone marrow 
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blasts and peripheral blood cells is necessary to establish the diagnosis. In some cases the 
addition of immunophenotyping is mandatory to separate very undifferentiated AML from 
acute lymphoblastic leukemia and CLL. Leukemia subtypes investigated can be diagnosed 
by cytomorphology alone, only if an expert reviews the smears. However, a genetic 

S analysis based on chromosome analysis, fluorescence in situ hybridization or RT-PCR and 
immimophenotyping is required in order to assign all cases into the right category. The aim 
of these techniques besides diagnosis is mainly to determine the prognosis of the leukemia. 
A major disadvantage of these methods, however, is that viable cells are necessary as the 
cells for genetic analysis have to divide in vitro in order to obtain metaphases for the 

10 analysis. Another problem is the long time of 72 hours from receipt of the material in the 
laboratory to obtain the result Furthermore, great experience in preparation of 
chromosomes and even more in analyzing the karyotypes is required to obtain the correct 
result in at least 90% of cases. Using these techniques in combination, hematological 
malignancies in a first approach are separated into chronic myeloid leukemia (CML), 

15 chronic lymphatic (CLL), acute lymphoblastic (ALL), and acute myeloid leukemia (AML). 
Within the latter three disease entities several prognostically relevant subtypes have been 

■ 

established. As a second approach this fiuther sub-classification is based mamly on genetic 
abnormalities of the leukemic blasts and clearly is associated with different prognoses. 



20 The sub-classification of leukemias becomes increasingly important to guide therapy. The 
development of new, specific drugs and treatment approaches requires the identification of 
specific subtypes that may benefit from a distinct therapeutic protocol and, thus, can 
improve outcome of distinct subsets of leukemia. For example, the new therapeutic drug 
(STI571, Imatinib) inhibits the CML specific chimeric tyrosine kinase BCR-ABL 

25 generated from the genetic defect observed in CML, the BCR-ABL-rearrangement due to 
the translocation between chromosomes 9 and 22 (t(9;22) (q34; ql 1)). In patients treated 
with this new drug, the therapy response is dramatically higher as compared to all other 
drugs that had been used so far. Another example is the subtype of acute myeloid leukemia 
AML M3 and its variant M3v both with karyotype t(15;17)(q22; ql 1-12). The introduction 

30 of a new drug (all-trans retinoic acid - ATRA) has improved the outcome in this subgroup 
of patients from about 50% to 85 % long-term survivors. As it is mandatory for these 
patients suffering from these specific leukemia subtypes to be identified as fast as possible 
so that the best therapy can be applied, diagnostics today must accomplish sub* 
classification with maximal precision. Not only for these subtypes but also for several 

35 other leukemia subtypes different treatment approaches could improve outcome. 



Therefore, rapid and precise identification of distinct leukemia subtypes is the future goal 
for diagnostics. 

Thus, the technical problem underlying the present invention was to provide means for 
leukemia diagnostics which overcome at least some of the disadvantages of the prior art 
diagnostic methods, in particular encompassing the time*consuming and unreliable 
combination of different methods and which provides a rapid assay to unambiguously 
distinguish one AML subtype from another, e.g. by genetic analysis. 

According to Golub et al. (Science, 1999, 286, 531-7), gene expression profiles can be 
used for class prediction and discriminating AML from ALL samples. However, for the 
analysis of acute leukemias the selection of the two different subgroups was performed 
using exclusively morphologic-phenotypical criteria. This was only descriptive and does 
not provide deeper insights into the pathogenesis or the underlying biology of the 
leukemia. The approach reproduces only very basic knowledge of cytomorphology and 
intends to differentiate classes. The data is not sufficient to predict prognostically relevant 
cytogenetic aberrations* 

Furthermore, the international application WO-A 03/039443 discloses marker genes the 
expression levels of which are characteristic for certain leukemia, e.g. AML subtypes and 
additionally discloses methods for differentiating between the subtype of AML cells by 
determining the expression profile of the disclosed marker genes. However, WO-A 
03/039443 does not provide guidance which set of distinct genes discriminate between two 
subtypes and, as such, can be routineously taken in order to distinguish one AML subtype 
from another. 

The problem is solved by the present invention, which provides a method for 
distinguishing AML-specific FLT3 length mutations from TKD mutations in a sample, the 
method comprising determining the expression level of markers selected from the markers 
identifiable by then: Affymetrix Identification Numbers (affy ID) as defined in Tables 1, 
and/or 2, 

wherein 



a lower expression of at least one polynucleotide defined by at least one of the 
numbers 1 to SO of Table l.l having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 1.1 having a positive fc value, 

is indicative for the presence of AML_D835 when AML_D835 is distinguished 
fiiom all other subtypes, 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one of the 
nimibers 1 to 50 of Table 1.2 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 1.2 having a positive fc value, 

is indicative for the presence of AML_Double when AML_Double is 
distinguished firom all other subtypes, 

* 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 1.3 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 1.3 having a positive fc value, 

is indicative for the presence of AML_Status-l when AML_Status-l is 
distinguished firom all other subtypes, • 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 1 .4 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 1,4 having a positive fc value, 

is indicative for the presence of AML_Status-2 when AML_Status-2 is 
distinguished firom all other subtypes, 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table L5 having a negative fc value, and/or 
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a higher expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 1.5 having a positive fc value» 

is indicative for the presence of AML_Status-3 when AML_Status-3 is 
distinguished from all other sub^es, 

5 and/or wherein 

a lower expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 1.6 having a negative & value^ and/or 

a higher expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 1.6 having a positive fc value, 

10 is indicative for the presence of AML_Status-4 when AML_Status-4 is 

distinguished from all other subtypes, 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 1 .7 having a negative fc value, and/or 

15 a higher expression of at least one polynucleotide defined by at least one: of the 

niunbers 1 to 50 of Table 1 H having a positive fc value, 

is indicative for the presence of AML_Status-5 when AML_Status-5 is 
distinguished firom all other subtypes, 

and/or wherein 

20 a lower expression of at least one polynucleotide defined by at least one of the 

numbers 1 to 50 of Table 1.8 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table L8 having a positive fc value, 

is indicative for the presence of AML_normal when AMLjaormal is 
23 distinguished fix>m all other subtypes, 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one of the 
nimibers 1 to 50 of Table 2.1 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least one of the 
30 numbers 1 to 50 of Table 2.1 having a positive fc value. 



is indicative for the presence of AML_D835 when AML_D835 is distinguished 
from AML JDouble, 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 2.2 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 2.2 having a positive fc value, 
is indicative for the presence of AML_D835 when AML_D835 is distinguished 
from AML_Status-l, 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one of the 

numbers 1 to 50 of Table 2.3 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least one of the 

numbers 1 to 50 of Table 2.3 having a positive fc value, 

is indicative for the presence of AML_D835 when AML_D835 is distinguished 

from AML_Status-2, 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 2.4 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 2.4 having a positive fc value, 
is indicative for tiie presence of AML_D835 when AML_D835 is distinguished 
from AML_Status-3, 

and/or wherein 

t 

a lower expression of at least one polynucleotide defined by at least one of die 
numbers 1 to 50 of Table 2.5 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least one of tiie 
mmibers 1 to 50 of Table 2.5 having a positive fc value, 
is indicative for tiie presence of AML_D835 when AML_D835 is distinguished 
from AML StahJS-4, 
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and/or wherein 

a lower expression of at least one polynucleotide defined by at least one of the 
numbers 1 to SO of Table 2.6 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least one of the 
S numbers 1 to SO of Table 2.6 having a positive fc value, 

is indicative for the presence of AMLJD83S when AMLJD83S is distinguished 
from AML_Status-5, 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one of the 
10 numbers 1 to SO of Table 2.7 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 2.7 having a positive fc value, 

is indicative for the presence of AML_D835 when AML_D835 is distinguished 
from AML_normal, 

15 and/or wherein 

a lower expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 2.8 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least one of the 
numbers 1 to SO of Table 2.8 having a positive fc value, 

20 is indicative for the presence of AMLJDouble when AMLJDouble is 

distinguished from AMLJStatus-1, 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one of the 
numbers 1 to SO of Table 2.9 having a negative £c value, and/or 

25 a higher expression of at least one polynucleotide defined by at least one of the 

numbers 1 to SO of Table 2.9 having apositive fc value, 

is indicative for the presence of AMLJDouble when AMLJDouble is 
distinguished firom AML JStatus-2, 

and/or wherein 
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a lower expression of at least one polynucleotide defined by at least one of the 
numbers 1 to SO of Table 2.10 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least one of the 
numbers 1 to SO of Table 2.10 having a positive fc value^^ 

5 is indicative for the presence of AML_Double when AML_Double is 

distinguished fix>m AML_Status-3, 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one of the 
numbers 1 to SO of Table 2.1 1 having a negative fc value, and/or 

10 a higher expression of at least one polynucleotide defined by at least one of the 

numbers 1 to SO of Table 2.1 1 having a positive fc value, 

is indicative for the presence of AML_Double when AML^Double is 
distinguished fi*om AML_Status-4, 

and/or wherein 

15 a lower expression of at least one polynucleotide defined by at least one of the 

numbers 1 to SO of Table 2.12 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least one of the 
numbers 1 to SO of Table 2.12 having a positive fc value, 

is indicative for the presence of AML_Double when AMLJDouble is 
20 distinguished firom AML_Status-S , 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 2.13 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least one of the 
25 nimibers 1 to SO of Table 2.13 having a positive fc value, 

is indicative for the presence of AMLJDouble when AMLJDouble is 
distinguished firom AML^normal, 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one of the 
30 numbers 1 to 50 of Table 2.14 having a negative fc value, and/or 



9 



a higher expression of at least one polynucleotide defined by at least one of the 
numbers 1 to SO of Table 2.14 having a positive fc value, 

is indicative for the presence of AML_Status-l when AML_Status-l is 

ft 

distinguished from AML_Status-2, 
5 and/or wherein 

a lower expression of at least one polynucleotide defined by at least one of the 
numbers 1 to SO of Table 2.1S having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least one of the 
niimbers 1 to 50 of Table 2.15 having a positive fc value, 

10 is indicative for the presence of AMLjStatus-1 when AML_Status*l is 

distinguished from AML_Status-3, 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one of the 
nimibers 1 to 50 of Table 2.16 having a negative fc value, and/or 

15 a higher expression of at least one polynucleotide defined by at least one iof the 

numbers 1 to 50 of Table 2.16 having a positive fc value, 

is indicative for the presence of AML_Status-l when AML_Status-l is. 
distinguished from AML_Status-4, 

m 

and/or wherein 

20 a lower expression of at least one polynucleotide defined by at least one of the 

numbers I to 50 of Table 2.17 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 2.17 having a positive fc value, 

is indicative for the presence of AML_Status-l when AML_Status*l is 
25 distinguished from AML_Status-S, 

and/or wherein 

* 

a lower expression of at least one polynucleotide defined by at least one of the 
numbers 1 to SO of Table 2.18 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least one of the 
30 numbers 1 to 50 of Tiable 2.18 having a positive fc value. 
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is indicative for the presence of AML__Status-l wtien AML_Status-l is 
distinguished from AML^normal, 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one of the 
numbers 1 to SO of Table 2.19 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 2.19 having a positive fc value, 

is indicative for the presence of AML_Status-2 when AML_Status-2 is 
distinguished from AML_Status-3, 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one of the 
numbers 1 to SO of Table 2.20 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 2.20 having a positive fc value, 

is indicative for the presence of AML_Status-2 when AML_Status-2 is 
- distinguished from AML_Status-4, 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one of the 
numbers 1 to SO of Table 2.21 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 2.21 having a positive fc value, 

is indicative for the presence of AML_Status-2 when AML_Status-2 is 
distinguished from AML_Status-5, 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one of the 
numbers 1 to SO of Table 2.22 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least one of the 
numbers 1 to SO of Table 2.22 having a positive fc value, 

is indicative for the presence of AML_Status-2 when AML_Status-2 is 
distinguished from AML_normal, 
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and/or wherein 

a lower expression of at least one polynucleotide defined by at least one of the 
numbers 1 to SO of Table 2.23 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least one of the 
S numbers 1 to 50 of Table 2.23 having a positive fc value, 

is indicative for the presence of AML_Status-3 when AMLj5tatus-3 is 
distinguished from AML_Status-4, 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one of the 
10 numbers 1 to SO of Table 2.24 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 2.24 having a positive fc value, 

is indicative for the presence of AML_Status-3 when AMLjStatus-3 is 
distinguished from AML_Status-S, 

15 and/or wherein 

a lower expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 2.25 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least one of the 
numbers 1 to SO of Table 2.25 having a positive fc value, 

20 is indicative for the presence of AML_Status-3 when AMLJStatus-3 is 

distinguished from AMLjnormal, 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one of the 
numbers 1 to SO of Table 2.26 having a ne^ive fc value, and/or 

25 a higher expression of at least one polynucleotide defined by at least one of the 

numbers 1 to 50 of Table 2.26 having a positive fc value, 

is indicative for the presence of AML_Status-4 when AML_Status-4 is 
distinguished from AML_Status-5, 

and/or wherein 
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a lower expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 2.27 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 2.27 having a positive fc value, 

5 is indicative for the presence of AML_Status-4 when AML_Status-4 is 

distinguished from AML_normaU 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 2.28 having a negative fc value, and/or 

10 a higher expression of at least one polynucleotide defined by at least one of the 

numbers 1 to 50 of Table 2.28 having a positive fc value, 

is indicative for the presence of AML_Status-5 when AML_Status-5 is 
distinguished firom AML_normal. 

15 FLT3 stands for FMS-like tyrosine kinase 3. TKD stand for tyrosin kinase domain of 
FLT3. 

Two mayor types of mutations in the FLT3-Gene have been described. 

20 1) Length mutations in the juxtamembrane domain (FLT3-LM, status 1, 2, 3, 4, 5) 

2) point mutations in codons D835 or 1836 or deletions of 1836 in the tyrosine kinase 
domain (all coded as D835, refers to TKD mutation). 

3) "Double" means that both types of mutations were found ui a single patient 

25 As used herein, the abbreviations used above apply for the foUov^g AML subtypes (see 

Also Example 1): 

1) AMLjiormal (normal karyotype) and no FLT3 mutation, 

2) AML_status 1: FLT3-LM/WT (wildtype) ratio <03, 

3) AML_status 2: ratio 0.7-1.1 , 

30 4) AML_status 3: ratio of>1.2=paraal loss ofWT (wild type), 

5) AML_status4: totallossof WT, 

6) AML_status 5 : two or more low status mutations 
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7) AML_D835: D835/TKD mutation (mutation in the tyrosine kinase domain of 
FLT3) 

8) AML_Double: mutations D835/rKD and FLT3-LM 

As used herein, '^all other subtypes" refer to the subtypes of the present invention, i.e. if 
one subtype is distinguished from ^all other subtypes'*, it is distiguished fi:om ail other 
subtypes contained in the present invention. 

According to the present invention, a '^sample'^ means any biological material containing 
genetic infomiation in the form of nucleic acids or proteins obtainable or obtained from an 
individual. The sample includes e.g. tissue samples, cell samples, bone marrow and/or 
body fluids such as blood, saliva, semen. Preferably, the sample is blood or bone marrow, 
more preferably the sample is bone marrow. The person skilled in the art is aware of 
methods, how to isolate nucleic acids and proteins from a sample. A general method for 
isolating and preparing nucleic acids from a sample is outlined in Example 3. 

According to the present invention, the term '"lower expression'' is generally assigned to all 
by numbers and AfEymetrix ID. definable polynucleotides the t-values and fold change (fc) 
values of which are negative, as indicated in the Tables. Accordingly, the term "higher 
expression" is generally assigned to all by numbers and Affymetrix ID. definable 
polynucleotides the t-values and fold change (fc) values of which are positive. 

According to tihe present invention, the term "expression" refers to the process by which 
mRNA or a polypeptide is produced based on the nucleic acid sequence of a gene, i.e. 
„expression'' also includes the formation of mRNA upon transcription. In accordance with 
the present invention, the term „determining the expression level" preferably refers to the 
determination of the level of expression, namely of the markers. 

Generally, ^'marker" refers to any genetically controlled difference which can be used in 
the genetic analysis of a test versus a control sample, for the purpose of assigning the 
sample to a defined genotype or phenotype. As used herein, "markers" refer to genes 
which are differendaUy expressed in, e.g., different AML subtypes. The markers can be 
defined by their gene symbol name, their encoded protein name, their transcript 



14 



identificatiott number (cluster identification number), the data base accession number, 
public accession number or GenBank identifier or. as done in the present invention, 
Ajaymetrix identification number, chromosomal location, UniGene accession number and 
cluster type, LocusLink accession number (see Examples and Tables). 



The Af^etrix identification number (affy ID) is accessible for anyone and the person 
skiUed in tiie art by entering tiie "gene expression omnibus" internet page of the National 
Center for Biotechnology Liformation (NCBI) (http://www.ncbi.nhn.nih.gov/geo/). In 
particular, tiie affy ID's of the polynucleotides used for tiie metiiod of tiie present invention 
are derivaJ from tiie so-called U133 chip. The sequence data of each identification number 
can be viewed at http://www.ncbijihn.nih.gov/geo/query/acc.cgi7accKjPL96 



Generally, tiie expression level of a marker is detennined by tiie determining tiae 
expression of its corresponding "polynucleotide" as described heremafter. 

Accordmg to tiie present mvention, tiie term ..polynucleotide" refers, generally, to a DNA, 
m particular cDNA, or KNA, in particular a cRNA, or a portion tiiereof or a polypeptide or 
a portion tiiereof hi tiie case of RNA (or cDNA), tiie polynucleotide is foraied upon 
transcription of a nucleotide sequence which is capable of expression. The polynucleotide 
fragments refer to fragments preferably of between at least 8, such as 10, 12, 15 or 18 
nucleotides and at least 50, such as 60, 80. 100. 200 or 300 nucleotides m lengtti. or a 
complementary sequence tiiereto, representing a consecutive stretch of nucleotides of a 
gene, cDNA or mRNA. In otiier terms, polynucleotides include also any fragment (or 
complementary sequence tiiereto) of a sequence derived from any of tiie markers defined 
above as long as these fr^pients unambiguously identify tiie marker. 



The deteraimation of tiie expression level may be effected at tiie transcriptional or 
translational level. i.e. at tiie level of mRNA or at tiie protem level. Protein fragments such 
as peptides or polypeptides advantageously comprise between at least 6 and at least 25, 
such as 30, 40, 80, 100 or 200 consecutive amino acids representative of tiie corresponding 
ftdl lengtii'protein. Six amino acids are generally recognized as tiie lowest peptidic stretch 
giving rise to a linear epitope recognized by an antibody, fiagment or derivative tiiereof 
Alternatively, tiie proteins or firagments tiiereof may be analysed usmg nucleic acid 
molecules specifically binding to tiiree-dimensional structures (aptamers). 
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Dependmg on the nature of the polynucleotide or polypeptide, the deteimination of the 
expression levels may be effected by a variety of methods. For determining and detecting 
the expression level, it is preferred in the present invention that the polynucleotide in 
particular the cRNA, is labeUed. 

l^e labelling of the polynucleotide or a polypeptide can occur by a variety of methods 
known to the skilled artisan. The label can be fluorescent, chemiluminescent, 
biolummescent, radioactive (such as or ^^P). ll,e labelling compound can be any 
labelhng compound being suitable for the labelling of polynucleotides and/or polypeptides 
Examples include fluorescent dyes, such as fluorescein, dichlorofluorescem,' 
hexachlorofluorescein, BODIPY variants. ROX. tetramethylrhodamm, rhodamin X 
Cyamne-2, Cyanine-3. Cyanine-5. Cyanine-7. IRD40. FluorX. Oregon Green, Alexa 
variants (available e.g. fiom Molecular Probes or Amersham Biosciences) and the like 
bxotm or biotinylated nucleotides, digoxigenin, radioisotopes, antibodies, enzymes and 
receptors. Dep«iding on the type of labelling, the detection is done via fluorescence 
measurements, conjugation to streptavidin and/or avidin, antigen-antibody- and/or 
^tibody-antibody-interactions, radioactivity measurements, as well as catalytic and/or 
leceptor/hgand interactions. Suitable methods include the direct labelling (incorporation) 
method, the amino-modified (amino-aUyl) nucleotide method (available e g from 
Ambion), and the primer tagging method (DNA dendrimer labelling, as kit available e g 
from Genisphere). Particularly preferred for the present invention is the use of biotin or 
baotinylated nucleotides for labelling, with the latter being directly incorporated into e g 
the cRNA polynucleotide by in vitro transcription. 

If the polynucleotide is mRNA cDNA may be prepared into which a detectable label as 
exemphfied above, is incorporated. Said detectably labelled cDNA, in single-stranded 
form^ rnay then be hybridised, preferably under stringent or highly stringent conditions to a 
panel of smgle-stranded oUgonucleotides representmg different genes and affixed to a solid 
support such as a chip. Upon applymg appropriate washing steps, those cDNAs wiU be 
detected or quantitatively detected that have a counterpart in the oligonucleotide panel 
Vanous advantageous embodiments of this general method are feasible. For example, the 
mRNA or the cDNA may be amplified e.g. by polymerase chain reaction, wherein it is 
preferable, for quantitative assessments, that the number of amplified copies corresponds 
relative to fiirther amplified mRNAs or cDNAs to die number of mRNAs originally 
present in the ceU. In a prefenred embodiment of the present invention, the cDNAs are 
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transcribed into cRNAs prior to the hybridisation step wherein only in the transcription 
step a label is incorporated into the nucleic acid and wherein the cRNA is employed for 
hybridisation. Alternatively, the label may be attached subsequent to the transcription step. 

5 Similarly, proteins from a cell or tissue under investigation may be contacted with a panel 
of aptamers or of antibodies or fragments or derivatives thereof. The antibodies etc. may be 
af&xed to a solid support such as a chip. Binding of proteins indicative of an AML subtype 
may be verified by binding to a detectably labelled secondary antibody or aptamer. For the 
labelling of antibodies, it is referred to Harlow and Lane, "Antibodies, a laboratory 

10 manual'', CSH Press, 1988, Cold Spring Harbor. Specifically, a minimum set of proteins 
necessary for diagnosis of all AML subtypes may be selected for creation of a protein array 
system to make diagnosis on a protein lysate of a diagnostic bone marrow sample directly. 
Protein Array Systems for the detection of specific protein expression profiles already are 
available (for example: Bio-Plex, BIORAD, Munchen, Germany). For this application 

15 preferably antibodies against the proteins have to be produced and immobilized on a 
platform e.g. glasslides or microtiterplates. The immobilized antibodies can be labelled 
with a reactant specific for the certain target proteins as discussed above. The reactants can 
include enzyme substrates, DNA, receptors, antigens or antibodies to create for example a 
capture sandwich immtmoassay. 

20 

For reliably distinguishing AML-specific FLT3 length mutations from TKD mutations it is 
useful that the expression of more than one of the above defined markers is determined. As 
a criterion for the choice of markers, the statistical significance of markers as expressed in 
q or p values based on the concept of the false discovery rate is determined. In doing so, a 
25 measure of statistical significance called the q value is eissociated with each tested feature. 
The q value is similar to the p value, except it is a measure of significance in terms of the 
false discovery rate rather than the false positive rate (Storey JD and Tibshirani R. 
Proc.Natl.Acad.Sci., 2003, Vol. 100:9440-5. 

30 In a preferred embodiment of the present invention, markers as defined in Tables 1-2 
having a q-value of less than, 3E-02, less than 3E-06, more preferred less than 1 .5E-09, 
most preferred less than 1.5E-1 1, less than L5E-20, less than 1.5E-30, are measured. 
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Of the above defined markers, the expression level of at least two, preferably of at least 
ten, more preferably of at least 2S, most preferably of SO of at least one of the Tables of the 
markers is determined. 

5 In another preferred embodiment, the expression level of at least 2, of at least 5, of at least 
10 out of the markers having the numbers 1 - 10, 1-20, 1-40, 1-50 of at least one of the 
Tables are measured* 

The level of the expression of the ^marker", i.e. the expression of the polynucleotide is 
10 indicative of the AML subtype of a cell or an organism. The level of expression of a 
marker or group of markers is measured and is compared with the level of expression of 
the same marker or the same group of markers firom other cells or samples. The 
comparison may be efiEected in an actual experiment or in silico. When the expression level 
also referred to as expression pattern or expression signature (expression profile) is 
15 measurably different, there is according to the invention a meaningfiil difference in the 
level of expression. Preferably the difference at least is 5 %, 10% or 20%, more preferred 
at least 50% or may even be as high as 75% or 100%. More preferred the difference in the 
level of expression is at least 200%, i.e. two fold, at least 500%, i.e. five fold, or at least 
1000%, i.e. 10 fold. 

20 

Accordingly, the expression level of markers expressed lower in a first subtype than in at 
least one second subtype, which differs firom the first subtype, is at least 5 %, 10% or 20%, 
more preferred at least 50% or may even be 75% or 1 00%, i.e. 2-fold lower, preferably at 
least 10-fold, more preferably at least 50-fold, and most preferably at least 100-fold lower 
25 in the first subtype. On flie other hand, the expression level of markers expressed higher in 
a first subtype than in at least one second subtype, which differs firom the first subtype, is 
at least 5 %, 10% or 20%, more preferred at least 50% or may even be 75% or 100%, i.e. 
2-fold higher, preferably at least 10-fold, more preferably at least 50-fold, and most 
preferably at least 100-fold higher in the first sub^e. 

30 

In another embodiment of the present invention, the sample is derived fi'om an individual 
having leukemia, preferably AML. 

For the method of the present invention it is preferred if the polynucleotide the expression 
35 level of which is determined is in form of a transcribed polynucleotide. A particularly 
preferred transcribed polynucleotide is an mRNA, a cDNA and/or a cRNA, witir the latter 
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being prefeired. Transcribed polynucleotides are isolated from a sample, reverse 
transcribed and/or amplified, and labelled, by employing methods well-known the person 
skilled in the art (see Example 3). In a preferred embodiment of the methods according to 
the invention, the step of determining the expression profile fiirther comprises amplifying 
5 the transcribed polynucleotide. 

In order to determine the expression level of the transcribed polynucleotide by the method 
of the present invention, it is preferred that the method comprises hybridizing the 
transcribed polynucleotide to a complementary polynucleotide, or a portion thereof, under 
10 stringent hybridization conditions, as described hereinafter. 

The term "hybridizing" means hybridization imder conventional hybridization conditions, 
preferably imder stringent conditions as described, for example, in Sambrook, J., et al., in 
"Molecular Cloning: A Laboratory Manual" (1989), Eds. J. Sambrook, E. F. Fritsch and T. 

15 Maniatis, Cold Spring Harbour Laboratory Press, Cold Spring Harbour, NY and the further 
definitions provided above. Such conditions are, for example, hybridization in 6x SSC, pH 
7.0 / 0.1% SDS at about 45°C for 18-23 hours, followed by a washing step with 2x 
SSC/0.1% SDS at SO^C. In order to select the stringency, the salt concentration in the 
washing step can for example be chosen between 2x SSC/0.1% SDS at room temperature 

20 for low stringency and 0.2x SSC/0.1% SDS at 50^C for high stringency. In addition, the 
temperature of the washing step can be varied between room temperature, ca. 22^C, for 
low stringency, and 65 °C to 70** C for high stringency. Also contemplated are 
polynucleotides that hybridize at lower stringency hybridization conditions. Changes in the 
stringency of hybridization and signal detection are primarily accomplished through the 

25 manipulation, preferably of formamide concentration (lower percentages of formamide 
result in lowered stringency), salt conditions, or temperature. For example, lower 
stringency conditions include an overnight incubation at 37^C in a solution comprising 6X 
SSPE (20X SSPE = 3M NaCl; 0.2M NaH2P04; 0.02M EDTA, pH 7.4), 0.5% SDS, 30% 
formamide, 100 mg/ml salmon sperm blocking DNA, followed by washes at 50°C with 1 

30 X SSPE, 0.1% SDS. In addition, to achieve even lower stringency, washes performed 
following stringent hybridization can be done at higher salt concentrations (e.g. Sx SSC). 
Variations in the above conditions may be accomplished through the inclusion and/or 
substitution of alternate blocking reagents used to suppress background in hybridization 
experiments. The inclusion of specific blocking reagents may require modification of the 

35 hybridization conditions described above, due to problems with compatibility. 
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"Complementaiy" and "complementarity', respectively, can be described by the 
percentage, i.e. proportion, of nucleotides which can form base pairs between two 
polynucleotide strands or within a specific region or domain of the two strands. Generally 
complementary nucleotides are. according to the base pairing rules, adenine and thymine' 
(or adenme and uracil), and cytosine and guanine. Complementarity may be partial in 
which only some of the nucleic acids' bases are matched according to the base pairing 
rules. Or, there may be a complete or total complementarity between the nucleic acids The 
degree of complementarity between nucleic acid strands has effects on the efficiency and 
strength of hybridization between nucleic acid strands. 

Two nucleic acid strands are considered to be 100»/o complementary to each other over a 
defined length if in a defined region all adenines of a first strand can pair with a thymine 
(or an uracil) of a second strand, all guanmes of a first strand can pair with a cytosine of a 
second strand, all thymine (or uracils) of a first strand can pair with an adenine of a second- 
Stand, and all cytosines of a first strand can pair with a guanine of a second strand, and 
vice versa. According to the present invention, the degree of complementarity is 
determined over a stretch of 20. preferably 25. nucleotides, i.e. a 60% complementarity 
means that within a region of 20 nucleotides of two nucleic acid strands 12 nucleotides of 
the first strand can base pah: with 12 nucleotides of the second strand according to the 
above ruling, either as a stretch of 12 contiguous nucleotides or interspersed by non-pairing 
nucleotides, when the two strands are attached to each other over said region of 20 
nucleotides. Ihe degree of complementarity can range fiom at least about 50% to fiill i e 
100% complementarity. Two single nucleic acid strands are said to be "substantially 
complemenlaiy- when they are at least about 80% complementary, preferably about 90% 
or higher. For carrying out the method of the present mvention substantial 
complementarity^ is preferred. 

Preferred methods for detection and quantification of the amount of polynucleotides i e 
for the methods according to the mvention allowing the determination of the level of 
expression of a marker, are those described by Sambrook et al. (1989) or real time methods 
known m the art as the TaqMan® method disclosed m WO92/02638 and the conespondmg 
U.S. 5.210.015. U.S. 5.804375. U.S. 5.487.972. This method exploits the exonuclease 
activity of a polymerase to generate a signal. In detail, the (at least one) target nucleic acid 
component is detected by a process comprising contacting the sample with an 
oligonucleotide containing a sequence complementary to a region of the target nucleic acid 
component and a labeled oUgonucleotide containmg a sequence complementaiy to a 
second region of the same target nucleic acid component sequence strand, but not 
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including the nucleic acid sequence defined by the first oligonucleotide, to create a mixture 
of duplexes during hybridization conditions, wherein the duplexes comprise the target 
nucleic acid annealed to the first oligonucleotide and to the labeled oligonucleotide such 
that the 3'-end of the first oligonucleotide is adjacent to the 5'-end of the labeled 

5 oligonucleotide. Then this mixture is treated with a template-dependent nucleic acid 
poljonerase having a 5' to 3' nuclease activity under conditions sufficient to permit the 5' 
to 3' nuclease activity of the polymerase to cleave the annealed, labeled oligonucleotide 
and release labeled firagments. The signal generated by the hydrolysis of the labeled 
oligonucleotide is detected and/ or measured. TaqMan® technology eliminates the need for 

10 a solid phase boimd reaction complex to be formed and made detectable. Other methods 
include e.g. fluorescence resonance energy transfer between two adjacently hybridized 
probes as used in the LightCycler® format described in U.S. 6,174,670. 

A preferred protocol if the marker, i.e. the polynucleotide, is in form of a transcribed 
15 nucleotide, is described in Example 3, where total RNA is isolated, cDNA and, 
subsequently, cRNA is synthesized and biotin is incorporated during the transcription 
reaction. The purified cRNA is applied to commercially available arrays which can be 
obtained e.g. fi'om Affymetrix. The hybridized cRNA is detected according to the methods 
described in Example 3. The arrays are produced by photolithography or other methods 
20 known to experts skilled in the art e.g. fix>m U.S. 5,445,934, U.S. 5,744,305, U.S. 
5,700,637, U.S. 5,945,334 and EP 0 619 321 or EP 0 373 203, or as decribed hereinafter in 
greater detail. 

In another embodiment of the present invention, the polynucleotide or at least one of the 
25 polynucleotides is in form of a polypeptide. In another preferred embodiment, the 
expression level of the polynucleotides or polypeptides is detected using a compound 
which specifically binds to the polynucleotide of the polypeptide of the present invention. 

As used herein, "specifically binding" means that the compound is capable of 
30 discrimmating between two or more polynucleotides or polypeptides, i.e. it binds to the 
desired polynucleotide or polypeptide, but essentially does not bind imspecificaily to a 
different polynucleotide or polypeptide. 

The compound can be an antibody, or a firagment thereof, an en2yme, a so-called small 
35 molecule compound, a protein-scaffold, preferably an anticalin. In a preferred 
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embodiment, the compound specificaUy binding to the polynucleotide or poljpeptide is an 
antibody, or a fragment thereof. 

As used herein, an "antibody" comprises monoclonal antibodies as first described by 
KQhler and Milstein in Nature 278 (1975), 495-497 as well as polyclonal antibodies, i.e. 
antibodies , contained in a polyclonal antiserum. Monoclonal antibodies include those 
produced by transgenic mice. Fragments of antibodies include F(ab')2, Fab and Fv 
Stents. Derivatives of antibodies include scFvs, chimeric and humanized antibodies. 
See, for example Hariow and Lane, loc. cit For the detection of polypeptides using 
antibodies or fiagments thereof the person skilled in the art is aware of a variety of 
methods, all of which are included in the present invention. Examples include 
immunoprecipitation. Western blotting. Enzyme-linked immuno sorbent assay (ELISA), 
Enzyme-lmked immuno sorbent assay (RIA), dissociation-enhanced lanthanide fluoro' 
immuno assay (DELFLV), scintiUation proximity assay (SPA). For detection, it is desirable 
if the antibody is labelled by one of the labelling compounds and methods described supra. 

In ano&er preferred embodiment of the present invention, the method for distinguishing 
AML-specific FLT3 length mutations fiom TKD mutations is carried out on an array. 

In general, an "airay" or "microanay" refers to a linear or two- or three dimensional 
arrangement of preferably discrete nucleic acid or polypeptide probes which comprises an 
mtentionally created collection of nucleic acid or polypeptide probes of any length spotted 
onto a substrate/solid support. The person skiUed in the art knows a collection of nucleic 
acids or polypeptide spotted onto a substrate/solid support also under the tenn "array^. As 
known to the person skUled m the art, a microarray usually refers to a miniaturised aiiay 
arrangement, with die probes being attached to a density of at least about 10, 20, 50 100 
nucleic acid molecules referring to different or the same genes per cml Furieniiore 
where appropriate an array can be referred to as "gene chip". The array itself can havj 
different formats, e.g. Ubraries of soluble probes or libraries Of probes tethered to resin 
beads, silica chips, or other solid supports. 

The process of array febrication is well-known to the person skilled in the ait. hi the 
following, the process for preparing a nucleic acid array is described. Commonly, the 
process comprises preparing a glass (or other) slide (e.g. chemical treatment of the gl^ to 
enhance binding of the niint<^ir» o/*;^ ^.^u^ *^ ^t. i n • . . . 
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representing genes of a genome of interest, and spotting sequences these sequences of 
interest onto glass slide. Sequences of int^est can be obtained via creating a cDNA library 
from an mRNA source or by using publicly available databases, such as GeneBank, to 
annotate the sequence information of custom cDNA libraries or to identify cDNA clones 

5 from previously prepared libraries. Generally, it is recommendable to amplify obtained 
sequences by PGR in order to have sufficient amounts of DNA to print on the array. The 
liquid containing the amplified probes can be deposited on the array by using a set of 
microspotting pins. Ideally, the amoimt deposited should be uniform. The process can 
further include UV-crosslinking in order to enhance immobilization of the probes on the 

10 array. 

In a prefenred embodiment, the array is a high density oligonucleotide (oligo) array using a 
light-directed chemical synthesis process, employing the so-called photolithography 
technology. Unlike common cDNA arrays, oligo arrays (according to the Affymetrix 

15 technology) use a single-dye technology. Given the sequence information of the markers, 
the sequence can be synthesized directly onto the array, thus, bypassing the need for 
physical intermediates, such as PGR products, required for making cDNA arrays. For this 
purpose, the marker, or partial sequences thereof, can be represented by 14 to 20 features, 
preferably by less than 14 features, more preferably less than 10 features, even more 

20 preferably by 6 features or less, with each feature being a short sequence of nucleotides 
(oligonucleotide), which is a perfect match (PM) to a segment of the respective gene. The 
PM oligonucleotide are paired with mismatch (MM) oligonucleotides which have a single 
mismatch at the central base of the nucleotide and are used as "controls". The chip 
exposure sites are defined by masks and are deprotected by the use of light, followed by a 

25 chemical coupling step resulting in the synthesis of one nucleotide. The masking, light 
deprotection, and coupling process can then be repeated to synthesize the next nucleotide, 
until the nucleotide chain is of the specified length. 

Advantageously, the method of the present invention is carried out in a robotics system 
30 including robotic plating and a robotic liquid transfer system, e.g. using microfluidics, i.e. 
channelled structured. 

A particular preferred method according to the present invention is as follows: 

1. Obtaining a sample, e.g. bone marrow or peripheral blood aliquots, firom a patient 

35 having AML 

2. Extracting RNA, preferably mRNA, firom the sample 

3. Reverse transcribing the RNA into cDNA 
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4. In vitro transcribing the cDNA ioto cRNA 

5. Fragmenting the cRNA 

6. Hybridizing the fragmented cRNA on standard raicroarrays 

7. Determining hybridization 



In another embodiment, the present invention is directed to the use of at least one marker 
selected from the markers identifiable by their Afifymetrix Identification Numbers (affy 
ID) as defined in Tables 1, and/or 2 for the manufecturing of a diagnostic for 
distingubhing AML-specific FLT3 length mutations from TKD mutations. Hie use of the 
present invention is particularly advantageous for distinguishing AML-specific FL13 
length mutations from TKD mutations in an individual having AML. The use of said 
markets for diagnosis of AML-specific FLT3 length mutations from TKD mutations, 
preferably based on microarray technology, offers the following advantages: (1) more' 
rapid and more precise diagnosis. (2) easy to use in laboratories without specialized 
experience, (3) aboUshes the requirement for analyzing viable cells for chromosome 
analysis (transport problem), and (4) very experienced hematologists for cytomoiphology 
and cytochemistry, immunophenotyping as weU as cytogeneticists and molecularbiologists 
are no longer required. 

Accordingly, the present invention refers to a diagnostic kit containing at least one marker 
selected from the markers identifiable by their Afiymetrix Identification Numbers (afiy 
ID) as defined in Tables 1, and/or 2 for distinguishing AML-specific FLT3 length 
mutations from TKD mutations, in combmation witii suitable auxiliaries. Suitable 
auxiliaries, as used herein, include buffers, enzymes, labelling compounds, and the like. In 
a preferred embodiment, tiie marker contained in the kit is a nucleic acid molecule which is 
capable of hybridizmg to tiie mRNA corresponding to at least one marker of tiie present 
mvention. Preferably, tiie at least one nucleic acid molecule is attached to a solid support. 
e.g. a polystyrene microtiter dish, nitrocellulose membrane, glass surface or to non- 
immobilized particles in soIutiotL 

In anotiier preferred embodiment, tiie diagnostic kit contains at least one reference for an 
AML-specific FLT3 lengfli mutation and/or TKD mutation. As used herein, tiie reference 
can be a sample or a data bank. 



In anotiier embodiment, tiie present invention is directed to an apparatus for distinguishing 
AML-specific FLT3 lengtti mutations from TKD mutations in a sample, containing a 
reference data bank obtainable by comprising 
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(a) compiling a gene expression profile of a patient sample by determining the 
expression level at least one marker selected firom the markers identifiable by 
their Affymetrix Identification Numbers (affy ID) as defined in Tables 1, and/or 
2, and 

(b) classifying the gene expression profile by means of a machine learning 
algorithm. 

According to the present invention, the ''machine learning algorithm" is a computational- 
based prediction methodology, also known to the person skilled in the art as "classifier'*, 
employed for characterizing a gene expression profile. The signals corresponding to a 
certain expression level which are obtained by the microarray hybridization are subjected 
to the algorithm in order to classify the expression profile. Supervised learning involves 
"training" a classifier to recognize the distinctions among classes and then 'testing" the 
accuracy of the classifier on an independent test set For new, unknown samples the 
classifier shall predict into which class the sample belongs. 

Preferably, the machine learning algorithm is selected firom the group consisting of 
Weighted Voting, K-Nearest Neighbors, Decision Tree Induction, Support Vector 
Machines (SVM), and Feed-Forward Neural Networks. Most preferably, the machine 
learning algorithm is Support Vector Machine, such as polynomial kernel and Gaussian 
Radial Basis Function-kernel SVM models. 

The classification accuracy of a given gene list for a set of microarray experiments is 
preferably estimated using Support Vector Machines (SVM), because there is evidence that 
SVM-based prediction slightly outperfomas other classification techniques like k-Nearest 
Neighbors (k-NN). The LBBSVM software package version 2.36 was used (SVM-type: C- 
SVC, linear kernel (http://www.csie.ntu.edu.tw/~cjlin/libsvm/)). The skilled artisan is 
fiirthermore referred to Brown et al., Proc.Nati.Acad.Sci., 2000; 97: 262-267, Furey et al., 
Bioinformatics. 2000; 16: 906-914, and Vapnik V. Statistical Learning Theory. New York: 
Wiley, 1998. 

In detail, the classification accuracy of a given gene list for a set of microarray experiments 
can be estimated usmg Support Vector Machmes (SVM) as supervised learning technique. 
Generally, SVMs are trained using differentially expressed genes which were identified on 
a subset of the data and then this trained model is employed to assign new samples to those 
trained groups ftom a second and different data set. Differentially expressed genes were 
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identified applying ANOVA and West-statistics (Welch t-test). Based on identified distinct 
gene expression signatures respective training sets consisting of 2/3 of cases and test sets 
with 1/3 of cases to eissess classification accuracies are designated. Assignment of cases to 
training and test set is randomized and balanced by diagnosis. Based on the training set a 
5 Support Vector Machine (S VM) model is built 

■ 

According to the present invention^ the apparent accuracy, i*e. the overall rate of correct 
predictions of the complete data set was estimated by lOfold cross validation. This means 
that the data set was divided into 10 2q>proximately equally sized subsets, an SVM-model 

1 0 was trained for 9 subsets and predictions were generated for the remaining subset. This 
training and prediction process was repeated 10 times to iuclude predictions for each 
subset Subsequently the data set was split into a training set, consisting of two thirds of the 
samples, and a test set with the remaining one third. Apparent accuracy for the training set 
was estimated by lOfold cross validation (analogous to apparent accuracy for complete 

1 5 set). A S VM-model of the training set was built to predict diagnosis in the independent test 
set, thereby estimating true accuracy of the prediction model. This prediction approach was 
applied both for overall classification (multi-class) and binary classification (diagnosis X 
=> yes or no). For the latter, sensitivity and specificity were calculated: 

Sensitivity = (number of positive samples predicted)/(number of tme positives) 

20 Specificity — (number of negative samples predicted)/(number of tme negatives) 

In a preferred embodiment, the referience data bank is backed up on a computational data 
memory chip which can be inserted in as well as removed from the apparatus of the present 
invention, e.g. like an interchangeable module, in order to use ano&er data memory chip 
25 containing a different reference data bank. 

The apparatus of the present invention containing a desired reference data bank can be 
used in a way such that an unknown sample is, first, subjected to gene expression profiling, 
e.g. by microarray analysis in a manner as described supra or in the art, and the expression 
30 level data obtained by the analysis are, second, fed into die apparatus and compared with 
the data of the reference data bank obtainable by the above method. For this purpose, the 
apparatus suitably contains a device for entering the expression level of die data, for 
example a control panel such as a keyboard. The results, whether and how the data of the 
unknown sample fit into the reference data bank can be made visible on a provided 
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monitor or display screen and, if desired, printed out on an incoiporated of connected 
printer. 

Alternatively, the apparatus of the present invention is equipped with particular appliances 
5 suitable for detecting and measuring the expression profile data and, subsequently, 
proceeding with the comparison with the reference data bank. In this embodiment, the 
apparatus of the present invention can contain a gripper arm and/or a tray which takes up 
the microarray containing the hybridized nucleic acids. 

10 In another embodiment, the present invention refers to a reference data bank for 
distinguishing AML-specific FLT3 length mutations from TKD mutations in a sample 

obtainable by comprising 

(a) compiling a gene expression profile of a patient sample by determining the 
expression level of at least one marker selected firom the markers identifiable by 

15 their Affymetrix Identification Niraibers (affy ID) as defined in Tables 1, and/or 

2, and 

(b) classifying the gene expression profile by means of a machine learning 
algorithm. 

20 Preferably, the reference data bank is backed up and/or contained in a computational 
memory data chip. 
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The invention is further illustrated in the following table and examples, without limiting 
the scope of the invention: 

TABLES 1.1-2,28 

Tables 1-2 show AML subtype analysis of AML-specific FLT3 length mutations from 
TKD mutations. The analysed markers are ordered according to their q-values, beginning 

« 

with the lowest q-values. 

For convenieace and a better understanding. Tables 1.1 to 2,28 are accompanied with explanatoiy tables 
(Table I.IA to 2J28A) where the numbering and the Aflymetrix Id are fiirdier defined by other parameters, 
e.g. gene bank accession niunben 

EXAMPLES 

Example 1: General experimental, design of the invention and results 

Dififerent subtypes of acute myeloid leukemia (AML) can clearly be distinguished by 
morphology^ cytogenetics, and molecular genetics. Mutations within the FLT3-gene are 
the most common genetic alterations in AML. Length mutations in the juxtamembrane 
domam of FLT3 CPLT3-LM) were detected in 460/2134 (21.6%) and mutations in ttie 
tyrosine kinase domain (TKD-mutations) in 98/1711 (5.7%) of newly diagnosed AML in 
our series. Both kinds of FLT3 mutations can be found most frequently in AML with 
normal karyotype (AML-NK). While for TKD mutations a prognostic significance has not 
yet been shown, the FLT3-LM defines a prognostically un&vorable subset of AML-NK. 
Some of these pts have loss of the wildtype (WT) FLT3 allele in addition to the FLT3-LM. 
These pts were shown to have an even worse outcome than FLT3-LM+ pts that retain the 
WT-allele. Here we addressed the questions 1) whether pts with FLT3 mutations can be 
identified from those without FLT3 mutations within the AML*NK and 2) whether 
different types of FLT3 mutations can be identified by distinct gene expression signatures. 
Therefore, 148 cases v/ifh AML-NK were analyzed by U133 set microarrays (Aflfymetrix). 
For each patient the FLT3-LM status was assessed by GeneScan analysis (Applied 
Biosystems) and the TKD mutation status by melting curve analysis and sequencing. The 
total cohort was subdivided into 8 groups 1) AML NK and no FLT3 mutation (n^3), 2) 
status 1: FLT3-LM/WT ratio <0.3 (n=12), 3) status 2: ratio OJ-1.1 (n=19), 4) status 3: 
ratio of >1.2=partial loss of WT (n=30), 5) status 4: total loss of WT (n=5), 6) status 5; two 
or more low status mutations (n=5), 7) TKD mutation (n=^10), 8) TKD + LM 
(n— 3).Microarray data was analyzed by pattern recognition algorithms (Principal 
Component Analysis (PCA) and hierarchical clustering), as well as Support Vector 
Machines (SVM) for estimation of classification accuracies. Therefore, all samples were 
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divided into a training set consisting of 2/3 of cases to built a SVM model and a test set 
with remaining 1/3 of cases. Differentially expressed genes were selected according to 
ANOVA and t-test-statistics in the training set. A specific expression pattern was assessed 
for each of the defined subgroups. Using pairwise comparisons, the TKD mutations can 
clearly be distingushed from the FLT3-LM. In addition, FLT3-LM with loss of WT reveal 
a iq>ecific expression pattern in comparison to low status FLT3-LM. By use of SVM 
comparisons to AML-NK and all other mutation classes an accuracy of 100% was found 
for status 4, 78% for status 3 (sensitivity 64%, specificity 84%), 74% for status 2 
(sensitivity 17%, specificity 88%), 75% for status 1 (sensitivity 50%, specificity 79%), 
88% for. TKD mutations (sensitivity 50%, specificity 95%), but only 8% for stafais 5 
mutations. In conclusion a high percentage of cases of the different FLT3-mutations can be 
exacfly assigned. Only the status 5 mutations can not be defined by a specific expression 
profiile. Besides their clinical differences we could show that pts with FLT3-LM with 
different quantitative status of tiie LM as well as the TKD mutations are also different with 
15 regard to their expression pattern. This supports the hypothesis i) that FLT3-LM and TKD 
mutations act through different downstream target genes and play different fimctional roles 
m leukemogenesis; ii) that FLT3-LM cases with loss of the WT-allele should be redded 
differentiy, i.e. with regard to prognosis and therapeutical interventions. 

20 

Example 2: General materials, methods and definitions of functional annotations 

The methods section contams both information on statistical analyses used for 
identification of differentially expressed genes and detailed annotation data of identified 
25 microarray probesets. 

Affvmetrix Probeset Annotation 

All annotation data of GeneChip® arrays are extracted from the NetAffit™ Analysis 
Center (internet website: www.affymetrix.com). Files for U133 set arrays, including 
30 U133A and U133B microarrays are derived firam the June 2003 release. The original 
publication refers to: Liu G, Loraine AE, Shigeta R, Clme M, Cheng J, Vahneekam V. Sun 
S, K\ilp D, Siani-Rose MA. NetAfiBc: Affymetrix probesets and annotations. Nucleic Acids 
Res. 2003-3 l(l):82-6. 

35 The sequence data are omitted due to their large size, and because they do not change, 
whereas the annotation data are updated periodically, for example new information on 
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chromomal location and functional annotation of the respective gene products. Sequence 
data are avaUable for download in the NetAfBc Do>vnload Center (www.af^etrix.com) 

Data fields: 

In the following section, the content of each field of the data files are described 
Microarray probesets, for example found to be differentially expressed between different 
types of leukemia samples are fiirther described by additional information. Ite fields are 
of the following types: 

1. GeneChip Array Information 

2. Probe Design Inforaiation 

3. Public Domain and Genomic References 

1 . GeneChip Array Information 
HG-U133 ProbeSet_ID: 

HG-U133 ProbeSet^ID describes the probe set identifier. Examples are- 200007 at 
20001 l_s_at, 200012_x_at. ~ 

GeneChip: 

The description of the GeneChip probe array name where the respective probeset is 
represented. Examples are: Affymetrix Human Genome U133A Array or Affymetrix 
Human Genome U133B Array. 

2. Probe Design Infomiation 

Sequence Type: 

The Sequence Type indicates whether the sequence is an Exemplar. Consensus or Control 
sequence. An Exemplar is a single nucleotide sequence taken directly from a pubUc 
database. Hus sequence could be an mRNA or EST. A Consensus sequence is a 
nucleotide sequence assembled by Afifymetrix, based on one or more sequence taken fiom 
a public database. 

Transcript ID: 

The cluster identification number with a sub-cluster identifier appended. 



Sequence Derived From 



30 



The accession number of the single sequence^ or representative sequence on which the 
probe set is based. Refer to the "Sequence Source" field to determine the database used. 

Sequence ID: 

For Exemplar sequences: Public accession number or GenBank identifier. For Consensus 
sequences: Afiymetrix identification nimiber or public accession number. 

Sequence Source: 

The database from which the sequence used to desig^ this probe set was taken. Examples 
are: GenBank®, RefSeq, UniGene, TIGR (annotations from The Institute for Genomic 
Research). 

3. Public Domain and Genomic References 
IS 

Most of the data in this section come from LocusLink and UniGene databases, and are 
aimotations of the reference sequence on which the probe set is modeled. 

Gene Symbol and Title: 

20 A gene symbol and a short title, when one is available. Such symbols are assigned by 
different organizations for different species. Afiymetrix annotational data come from the 
UniGene record. There is no indication which species-specific databank was used, but 
some of the possibilities include for example HUGO: The Human Genome Organization. 

25 MapLocation: 

The map location describes the chromosomal location when one is available. 

Unigene_Accession: 

UniGene accession number and cluster type. Cluster type can be "full length" or "est", or 
30 — " if unknown. 

LocusLink: 

This information represents the LocusLink accession number. 

35 Ftill Length Ref. Sequences: 

Indicates the references to multiple sequences in RefSeq. The field contains the ID and 
description for each entry, and there can be multiple entries per probeSet 
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Example 3: Sample preparation, processing and data analysis 
Method 1: 

Microarray analyses were perfonned utilizing the GeneChip® System (Afi^etrix, Santa 
Clara, USA). Hybridization target preparations were performed according to recommended 
protocols (Affymetrix Technical Manual). In detail, at time of diagnosis, mononuclear ceUs 
were purified by FicoU-Hypaque density centrifugation. They had been lysed fanmediately 
in RLT buffer (Qiagen, Hildefl, Germany), frozen, and stored at -SOX from 1 week to 38 
months. For gene expression profiling cell lysates of the leukemia samples were thawed, 
homogenized (QIAshredder, Qiagen), and total RNA was extracted (RNeasy Mini Kit* 
Qiagen). Subsequently, 5-10 jig total RNA isolated from 1 x lO' cells was used as starting 
material for cDNA synthesis with oligo[(dT)24T7promotDr]65 primer (cDNA Synthesis 
System, Roche Applied Science, Mannheim, Germany). cDNA products were purified by 
phenol/chlorophormAAA extraction (Ambion, Austin, USA) and acetate/ethanol- 
precipitated overnight. For detection of the hybridized target nucleic acid biotin-labeled 
ribonucleotides were incorporated during the foUowmg in vitro transcription reaction 
(Enzo BioArray HighYield RNA Transcript Labeling Kit. Enzo Diagnostics). After 
quantification by spectrophotometric measurements and 260/280 absorbance values 
assessment for quaUty control of the purified cRNA (RNeasy Mini Kit, Qiagen). 15 ^g 
cRNA was fragmented by alkaline treatment (200 mM Tris-acetate, pH 8.2/500 mM 
potassium acetate/150 mM magnesium acetate) and added to the hybridization cocktail 
sufficient for five hybridizations on standard GeneChip microarrays (300 \il final volume). 
Washing and staining of the probe arrays was performed accoidmg to the recommended 
Fluidics Station protocol (EukGE-WS2v4). Affymetrix Microarray Suite software (version 
5.0.1) extracted fluorescence signal intensities from each feature on the microarrays as 
detected by confocal laser scanning according to the manufecturer's recommendations. 

Expression analysis quality assessment parameters included visiual array inspection of the 
scanned image for the presence of image artifacts and correct grid alignment for the 
identification of distinct probe cells as weU as both low 375' ratio of housekeeping 
controls (mean: 1.90 fi>r GAPDH) and high percentage of detection caUs (mean: 46.3% 
present caUed genes). ITie 3' to 5' ratio of GAPDH probesets can be used to assess RNA 
sample and assay quaUty. Signal values of the 3* probe sets for GAPDH are compared to 
the Signal values of the corresponding 5» probe set. The ratio of the 3» probe set to the 5» 
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probe set is generally no more than 3.0. A high 3' to 5' ratio may indicate degraded RNA 
or inefficient synthesis of ds cDNA or biotinylated cRNA (GeneChip® Expression 
Analysis Technical Manual, www.afifymetrix.com). Detection calls are used to determine 
whether the transcript of a gene is detected (present) or undetected (absent) and were 
calculated using de&ult parameters of the Microarray Analysis Suite MAS 5.0 software 
package. 



Method 2: 

Bone mairow (BM) aspirates are taken at the time of the initial diagnostic biopsy and 
10 remaining material is unmediately lysed in RLT buffer (Qiagen), frozen and stored at -80 
C until preparation for gene expression analysis. For microarray analysis the GeneChip 
System (Afifymetrix, Santa Clara, CA, USA) is used. The targets for GeneChip analysis are 
prepared according to the current Expression Analysis. Briefly, frozen lysates of the 
leukemia samples are thawed, homogenized (QIAshredder, Qiagen) and total RNA 
15 extracted (RNeasy Mini Kit, Qiagen) .Normally 10 ug total RNA isolated from 1 x 107 
cells is used as starting material in the subsequent cDNA-Synthesis using 01igo-dT-T7- 
Promotor Primer (cDNA synthesis Kit, Roche Molecular Biochemicals). The cDNA is 
purified by phenol-chlorophorm extraction and precipitated with 100% Ethanol over night 
For detection of the hybridized target nucleic acid biotin-labeled ribonucleotides are 
20 incorporated during the in vitro transcription reaction (Enzo® BioArray™ HighYield™ 
RNA Transcript Labeling Kit, ENZO). After quantification of the purified cRNA (RNeasy 
Mini Kit, Qiagen), IS ug are fragmented by alkaluie treatment (200 mM Tris-acetate, pH 
8.2, 500 mM potassium acetate, 150 mM magnesium acetate) and added to the 
hybridization cocktail siif&cient for 5 hybridizations on standard GeneChip microarrays. 
25 Before expression profiling TesG Probe Arrays (Asymetrix) are chosen for monitoring of 
the integrity of the cRNA. Only labeled cRNA-cocktails which shovsred a ratio of the 
messured mtensity of the 3* to the 5' end of the GAPDH gene less than 3.0 are selected for 
subsequent hybridization on HG-U133 probe arrays (Afifymetrix). Washing and staining 
the Probe arrays is performed as described (siehe Aflfymetrix-Original-Literatur 
30 (LOCKHART und LIPSHUTZ). The Affymetrbc software (Microarray Suite, Version 
4.0.1) extracted fluorescence intensities fiom each element on the arrays as detected by 
confocal laser scanning according to the manufacturers recommendations. 
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Claims 



1. A method for distinguishing AML-specific FLT3 length mutations fiom TKD 
mutations in a sample, the method comprising detenniniiig the expression level of 
markers selected from the markers identifiable by their Affymetrix Identification 
Numbers (afiEy ID) as defined in Tables 1, and/or 2, 

wherein 

a lower expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table I.l having a negative fc value, and/or 

• 

a higher expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 1.1 having a positive fc value, 

is mdicative for die presence of AML_D835 when AML_D835 is distinguished 
&om all other subtypes, 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 1 .2 having a negative fc value, and'or 

a higher expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 1 .2 having a positive fc value, 

is indicative for die presence of AMLJDouble when AMLJDouble is 
distinguished from all other subtypes, 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 13 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least one of tiie 
numbers 1 to 50 of Table 1.3 having a positive fc value, 

is mdicative for die presence of AML_Status-l when AML_Stattis-l is 
distinguished from all other sub^es, 

and/or wherein 



a lower expression of at least one polynucleotide defined by at least one of the 
nxraibers 1 to 50 of Table 1 .4 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 1 A having a positive fc value, 

is indicative for the presence of AML_Status-2 when AML_Status-2 is 
distinguished firom all other subtypes, 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 1.5 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least one of the 
nvimbers 1 to 50 of Table 1.5 having a positive fc value, 

is indicative for the presence of AML_Status-3 when AML_Status-3 is 
distinguished firom all other subtypes, 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 1.6 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 1.6 having a positive fc value, 

« 

is indicative for the presence of AML_Status-4 when AML_Status-4 is 
distinguished firom all other subtypes, 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 1.7 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 1 .7 having a positive fc value, 

is indicative for the presence of AML_Status-5 when AML_Status-5 is 
distinguished firom all other subtypes, 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 1.8 having a negative fc value, and/or 
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a higher expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 1.8 having a positive fc value, 

is indicative for the presence of AlVtL^normal when AML_normal is 
distinguished from all other subtypes, 

5 and/or wherein 

a lower expression of at least one polynucleotide defined by at least one of the 
numbers 1 to SO of Table 2.1 having a negative fc value, and/or 

a higher expressioti of at least one pol3mucIeotide defined by at least one of the 
numbers 1 to 50 of Table 2.1 having a positive fc value, 

10 is indicative for the presence of AML_P835 when AMLJD835 is distinguished 

from AML_Double, 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one of the 
numbers I to SO of Table 2.2 having a negative fc value, and/or 

15 a higher expression of at least one polynucleotide defined by at least one of the 

numbers 1 to SO of Table 2.2 having a positive fc value, 

is indicative for the. presence of AML_D835 when AML_D835 is distinguished 
from AML_Status-l, 

and/or wherein 

20 a lower expression of at least one polynucleotide defined by at least one of the 

numbers 1 to SO of Table 2.3 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 2.3 having a positive fc value, 

is indicative for the presence of AML_D83S when AML_D83S is distinguished 
25 from AML_Status-2, 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 2 A having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least one of the 
30 numbers 1 to SO of Table 2.4 having a positive fc value. 



is indicative for the presence of AML_D835 when AML_D835 is distinguished 
from AML_Status-3, 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one of the 
5 numbers 1 to 50 of Table 2.5 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 2.5 having a positive fc value, 

is indicative for the presence of AML_D835 when AML_D835 is distinguished 
from AML_Status-4, 

10 and/or wherein 

a lower expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 2.6 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defmed by at least one of the 
numbers I to 50 of Table 2.6 having a positive fc value, 
15 is indicative for the presence of AML_D835 when AML_D835 is distinguished 

from AML_Status-5, 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 2.7 having a negative fc value, and/or 

20 a higher expression of at least one polynucleotide defined by at least one of the 

numbers 1 to 50 of Table 2.7 having a positive fc value, 

is indicative for the presence of AML_D835 when AML_D835 is distinguished 
from AMLjnormal, 

and/or wherein 

25 a lower expression of at least one polynucleotide defined by at least one of the 

numbers 1 to 50 of Table 2.8 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 2.8 having a positive fc value, 

is indicative for the presence of AMLJDouble when AML_Double is 
30 distinguished from AML_Status-l , 



and/or wherein 



a lower expression of at least one polynucleotide defined by at least one of die 
numbers 1 to 50 of Table 2.9 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 2.9 having a positive fc value, 

is indicative for the presence of AML_Double when AML_Double is 
distinguished from AML_Status-2, 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one of the 
numbers l,to 50 of Table 2.10 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 2.10 having a positive fc value, 

is indicative for the presence of AMLJDouble when AML_Double is 
distinguished firom AML_Status-3, 

and/or wherein 

a lower expression of at least one polynucleotide defiined by at least one of the 
numbers 1 to 50 of Table 2.1 1 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 2. 11 having a positive fc value, 

is indicative for the presence of AML_Double when AML_Double is 
distinguished from AMLjStatus-4, 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 2.12 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 2.12 having a positive fc value, 

is indicative for the presence of AML_Double when AMLJDouble is 
distinguished &om AML_Status-5, 

and/or wherein 



a lower expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 2.13 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 2.13 having a positive fc value, 

is indicative for the presence of AML_Double when AML_Double is 
distinguished fix>m AML_normal, 

and/or whorein 

a lower expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 2.14 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 2.14 having a positive fc value, 

is indicative for the presence of AML_Status-l when AML_Status-l is 
distinguished firom AML_Status-2, 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 2.15 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 2.15 having a positive fc value, 

is indicative for the presence of AML_Status-l when AML_Status-l is 
distinguished firom AML_Status-3, 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 2.16 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 2.16 having a positive fc value, 

is indicative for the presence of AML_Status-l when AML_Status-l is 
distinguished fix>m AML_Status-4, 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one of tiie 
numbers 1 to 50 of Table 2.17 having a negative fc value, and/or 



a higher expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 2.17 having a positive fc value, 

is indicative for the presence of AML_Status-l when AML_Status-l is 
distinguished from AML_Status-5, 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 2.18 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 2.18 having a positive fc value, 

is indicative for the presence of AML_Status-l when AML_Status-l is 
distinguished from AML_normal, 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one. of the 
numbers 1 to 50 of Table 2. 19 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 2. 19 having a positive fc value, 

is indicative for the presence of AML_Status-2 when AML_Status-2 is 
distinguished from AML_Status-3, 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 2.20 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 2.20 having a positive fc value, 

■ 

is indicative for the presence of AML_Status-2 when AML_Status-2 is 
distinguished finom AML_Status-4, 

and/or wherein 

a lower eacpression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 2.21 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 2.21 having a positive fc value. 
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is indicative for the presence of AML_Status-2 when AML_Statiis-2 is • 
distinguished from AML_Status-5, 

and/or wherein 

a lower expression of at least one polynucleotide defined by at least one of the 
5 numbers 1 to 50 of Table 2.22 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 2.22 having a positive fc value, 

is indicative for the presence of AML_Status-2 when AML_Status-2 is 
distinguished from AML^normal, 

10 and/or wherein 

a lower expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 2.23 having a negadve fc value, and/or 

a higher expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 2.23 having a positive fc value, 

1 5 is indicative for the presence of AML_Status-3 when AML_Status-3 is 

distinguished from AML_Status-4, 

and/or wherein 

■ 

a lower expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 2.24 having a negative fc value, and/or 

20 a higher expression of at least one polynucleotide defined by at least one of the 

numbers 1 to 50 of Table 2.24 having a positive fc value, 

is indicative for the presence of AML_Status-3 when AML_Status-3 is 
distinguished from AML_Status-5, 

and/or wherein 

25 a lower expression of at least one polynucleotide defined by at least one of the 

numbers 1 to 50 of Table 2.25 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 2.25 having a positive fc value, 

is indicative for the presence of AML_Status-3 when AML_Status-3 is 
30 distinguished from AMLjaormal, 



and/or wherein 



a lower expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 2.26 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 2.26 having a positive fc value, 

is indicative for the presence of AML_Status-4 when AML_Status-4 is 
distinguished firom AML_Status-5, ~ 



and/or wherein 



a lower expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 2.27 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least one of the 
numbers I to 50 of Table 2.27 having a positive fc value, 

is mdicative for the presence of AML_Status-4 when AML_Status-4 is 
distinguished fi-om AMLjoormal, 



and/or wherein 



a lower expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 2.28 having a negative fc value, and/or 

a higher expression of at least one polynucleotide defined by at least one of the 
numbers 1 to 50 of Table 2.28 having a positive fc value, 

is indicative for the presence of AML_Status-5 when AML_Status-5 is 
distinguished firom AML normal. 



2. The method according to claim 1 wfaerem the polynucleotide is labelled. 



3. The method according to claim 1 or 2, wherein the label is a luminescent, 
preferably a fluorescent label, an enzymatic or a radioactive label. 



The method according at least one of the claims 1-3, wherein the expression level 
of at least two, preferably of at least ten, more preferably of at least 25, most 
preferably of 50 of the markers of at least one of the Tables 1-2 is determined. 



-10- 



The method according to at least one of the claims 1-4, wherein the expression 
level of markers expressed lower in a first subtype than in at least one second 

« 

subtype, which differs from the first subtype, is at least 5 %, 10% or 20%, more 
preferred at least 50% or may even be 75% or 100%, i.e. 2-fold lower, preferably at 
least 10-fold, more preferably at least 50-fold, arid most preferably at least 100-fold 
lower in the first subtype. 

The method according to at least one of the claims 1-4, wherein the expression 
level of markers expressed higher in a first subtype than in at least one second 
subtype, which differs firom the first subtype, is at least 5 %, 10% or 20%, more 
preferred at least 50% or may even be 75% or 100%, Le. 2-fold higher, preferably 
at least 10-fold, more preferably at least 50-fold, and most preferably at least 100- 
fold higher in the first subtype. 

The method according to at least one of the claims 1-6, wherein the sample is firom 
an individual having AML. 

The method according to at least one of the claims 1-7, wherein at least one 
polynucleotide is in the form of a transcribed polynucleotide, or a portion thereof. 

The method according to claim 8, wherein the transcribed polynucleotide is a 
mSNA or a cDNA. 

The method according to claim 8 or 9, wherein the determining of the expression 
level comprises hybridizing the transcribed polynucleotide to a complementary 
polynucleotide, or a portion thereof, under stringent hybridization conditions. 

The metiiod according to at least one of the claims 1-7, wherein at least one 
polynucleotide is in the form of a polypeptide, or a portion thereof. 
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12. The method according to at least one of the claims 8, 9 or 12, wherein the 

determining of the expression level comprises contacting the polynucleotide or the 
polypeptide with a compound specificaUy binding to the polynucleotide or the 
polypeptide. 



13. The method according to claim 12, wherein the compound is an antibody, or a 
fragment thereof. 



The method according to at least one of the claims 1-13, wherein the method is 
carried out on an array. 



15. The method according to at least one of the claims 1-14, wherem the method is 
carried out in a robotics system. 



16. The method accordhig to at least one of the claims 1-15, wherein the method is 
carried out using microfluidics. 



1 7. Use of at least one marker as defined in at least one of the claims 1-3 for the 
manufecturing of a diagnostic for distinguishmg AML-specific FLT3 length 
mutations fiom TKD mutations. 



18. The use according to claim 17 for distmguishing AML_MLL, t(15;17), t(8;21), 

inv(16), 1 lq23, de novo_AML, s_AML, t_AML, AML_MO, AML_M1, AML_M2, 

25 AML_M4,AMLj^a,AML_M5b,AML_M6,AML_t(15;17)/M3 and/or 

AML_t(l 5; 1 7)/M3 V in an individual having AML. 



19. A diagnostic kit containing at least one marker as defined in at least one of the 
claims 1-3 for distinguishing AML-specific FLT3 length mutations fiom TEa> 
30 mutations, in combination with suitable auxiliaries. 
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The diagnostic kit according to claim 19, wherein the kit contains at least one 
reference for the AML-specific FLT3 length mutations and/or TKD mutations. 

The diagnostic kit according to claim 20, wherein the reference is a sample or a 
databank* 

An apparatus for distinguishing AML-specific FLT3 length mutations from TKD 
mutations in a sample containing a reference data bank. 

The apparatus according to claim 22, wherein the reference data bank is obtainable 
by comprising 

(a) compiling a gene expression profile of a patient sample by determining the 
expression level of at least one marker selected fix)m the markers 
identifiable by their Affymetrix Identification Nimibers (afiy ID) as defined 
in Tables 1, and/or 2, and 

(b) classifying the gene expression profile by means of a machine learning 
algorithm. 

The apparatus according to claim 23, wherein the machine learning algorithm is 
selected firom the group consisting of Weighted Voting, K-Nearest Neighbors, 
Decision Tree Induction, Support Vector Machines, and Feed-Forward Neural 
Networks, preferably Support Vector Machines, 

The apparatus according to at least one of the claims 22-24, wherein the apparatus 
contains a control panel and/or a monitor. 

* 

A reference data bank for distinguishing AML-specific FLT3 length mutations 
firom TKD mutations obtainable by comprising 

(a) compiling a gene expression profile of a patient sample by determining the 
expression level of at least one marker selected from the markers 
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identifiable by their Afiymetrix Identification Numbers (jsSy ID) as defined 
in Tables 1 , and/or 2, and 
(b) classifying the gene expression profile by means of a machine learning 
algorithm. 

. The reference data bank according to claim 26, wherein the reference data bank is 
backed up and/or contained in a computational memory chip. 
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F. Hofi&nann-La Roche AG 
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Abstract 

Disclosed is a method for distinguishing AML-specific FLT3 length mutations from TKD 
mutations in a sample by detemiining the expression level of markers, as well as a 
diagnostic kit and an £q>paiatus containing the markers. 
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